618 results found.
Written
Corpus,
Language Type:
Multilingual
Languages:
Dutch English German Spanish
Availability:
Freely Available
License:
Size:
None Production Status:
Existing-used
Use:
Named Entity Recognition
-
Paper title:Sources of Transfer in Multilingual Named Entity Recognition
-
Paper track:Long/Information Extraction
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | David Mueller | CoNLL 2003 NER shared task corpus | /N |
Documentation:
None
Speech
Corpus,
Language Type:
Multilingual
Languages:
Cantonese English French German Gishu Greek Gujarati Hebrew Hindi Indonesian Japanese Korean Mandarin Persian Portuguese Runyankore Russian Spanish Turkish Vietnamese
Availability:
Freely Available
License:
OpenSource
Size:
22.8 GByte Production Status:
Newly created-in progress
Use:
Speech Recognition/Understanding
-
Paper title:Speaking rate, information density, and information rate in first-language and second-language speech
-
Paper track:1.10 Bilingual and L2 acquisition and processing/Oral Presentation
-
Paper status:Accept - Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Ann Bradlow | The ALLSSTAR Corpus | /N |
Documentation:
Documentation in English is available to the public (via the project website)
Speech
Corpus,
Language Type:
Monolingual
Languages:
German
Availability:
From Owner
License:
Size:
16462 sentences Production Status:
Newly created-finished
Use:
Sleepiness Recognition
-
Paper title:The INTERSPEECH 2019 Computational Paralinguistics Challenge: Styrian Dialects, Continuous Sleepiness, Baby Sounds & Orca Activity
-
Paper track:13.1 The Interspeech 2019 Computational Paralingui/Oral Presentation
-
Paper status:Accept Special Session
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Björn Schuller | SLEEP | /N |
Documentation:
None
Written
Corpus,
Language Type:
Monolingual
Languages:
German
Availability:
From Owner
License:
Size:
851 Diagnostic Reasoning Essays OtherProduction Status:
Newly created-finished
Use:
Epistemic Activity Segmentation and Classification
-
Paper title:Analysis of Automatic Annotation Suggestions for Hard Discourse-Level Tasks in Expert Domains
-
Paper track:Long/Resources and Evaluation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Claudia Schulz | Diagnostic Reasoning Corpus annotated with Epistemic Activities | /N |
Documentation:
None
Written
Corpus,
Language Type:
Multilingual
Languages:
English German Spanish
Availability:
From Owner
License:
Size:
10000 Onion Addresses OtherProduction Status:
Newly created-finished
Use:
Document Classification, Text categorisation
-
Paper title:The Language of Legal and Illegal Activity on the Darknet
-
Paper track:Long/Applications
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Daniel Hershcovich | DUTA-10K | /N |
Documentation:
None
Written
Evaluation Data,
Language Type:
Multilingual
Languages:
Croatian English German Italian
Availability:
Freely Available
License:
CreativeCommons
Size:
830 KByte Production Status:
Newly created-in progress
Use:
Textual Entailment and Paraphrasing
-
Paper title:Multilingual and Cross-Lingual Graded Lexical Entailment
-
Paper track:Long/Multilinguality
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Ivan Vulić | Multilingual and cross'lingual graded-lexical entailment datasets | /N |
Documentation:
Documentation and annotation guidelines available in all 4 languages.
Written
Corpus,
Language Type:
Multilingual
Languages:
English French German Italian Spanish
Availability:
Freely Available
License:
Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
Size:
~1000000 sentences Production Status:
Newly created-finished
Use:
Word Sense Disambiguation
-
Paper title:Just "OneSeC" for Producing Multilingual Sense-Annotated Data
-
Paper track:Long/Resources and Evaluation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Tommaso Pasini | OneSeC | /N |
Documentation:
None
Written
Corpus,
Language Type:
Multilingual
Languages:
English French German Japanese
Availability:
Freely Available
License:
Size:
None Production Status:
Existing-updated
Use:
-
Paper title:Multi-Source Cross-Lingual Model Transfer: Learning What to Share
-
Paper track:Long/Multilinguality
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Xilun Chen | Cross-Lingual Sentiment Dataset | /N |
Documentation:
None
Written
Corpus,
Language Type:
Bilingual
Languages:
English German
Availability:
License:
Size:
None Production Status:
Existing-used
Use:
-
Paper title:Multi-Source Cross-Lingual Model Transfer: Learning What to Share
-
Paper track:Long/Multilinguality
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Xilun Chen | CoNLL 2003 shared task (NER) data | /N |
Documentation:
None
Written
Corpus,
Language Type:
Monolingual
Languages:
German
Availability:
From Owner
License:
Size:
423 articles OtherProduction Status:
Newly created-in progress
Use:
Political Science
-
Paper title:Who Sides with Whom? Towards Computational Construction of Discourse Networks for Political Debates
-
Paper track:Short/Multidisciplinary and Area Chair COI
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Sebastian Padó | MARDY corpus of political claims | /N |
Documentation:
None




